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1. INTRODUCTION 

The state of privacy condition in communication is at risk in this current global era. The presence of 
intruders in the communication technology is possibly qualified anyone to retrieve information without 
permission. The irresponsible intruders may disclose the information to uninvolved parties to check or 
modify it for abusing that information [1]. In order to increase interest in personal conversations by third 
parties using the open source system on Internet, it is essential to take further measures to confirm the 
protection properly. One of the resolutions in dealing with the issue is steganography. 

Steganography is one of the sub-disciplines of information hiding that is defined as the art and 
science of covering message so that the existence of a message has not been discovered. It becomes the 
proper options for users to share information without any known by any other party so only sender and the 
exact receiver are able to know the information. The implementation of steganography is able to hide the 
information in several medium such as text, audio, video, image [2]. The categories of steganography show 
in Figure 1. 

Following Figure 1, this paper is considering the implementation of steganography using the text as 
medium to cover hidden message. This type of steganography is known as text steganography, which focuses 
to conceal messages by manipulating the element in text such as, word, space, line and other characters in the 
sentence of text [3]. 
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The development of techniques in text steganography consists of two kinds of techniques; word-rule 
based and feature-based. Word-rule based is the technique that embeds the hidden message based on word 
pattern by shifting in the text. The word-rule based techniques consist of line-shift coding and word-shift 
coding. For the line-shift coding, it hides the hidden message with a vertically shifting hidden message in text 
lines. Whereas word-shift coding hides the hidden message with horizontally shifting the hidden message in 
the length between words [4]. 
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Figure 1. Major category of steganography 


Technical 
Steganography 


For feature-based technique, it alters the feature of the letter by manipulating in the shape, size, and 
position of the font in the text. The feature-based technique makes the reader difficult to recognize the hidden 
information in this text [5]. Then, due to the characteristic of this technique, feature-based can be used by 
many researchers based on the character of language in the world. Based on current effort research about 
feature-based technique, it consists of two categories; language based that can be used only in certain 
language and letter based that can be used in any language that uses A until Z letter. The classification of 
word-rule based and feature-based as text steganography shows in Figure 2. 
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Figure 2. Major category of implementation of steganography 


Figure 2 has shown the category of text steganography that is the focus of study in this paper. It 
classified any kind of techniques in word-rule based and feature-based based on category, then identify the 
issue in the technique. 


Indonesian J Elec Eng & Comp Sci, Vol. 11, No. 2, August 2018 : 747 — 754 


Indonesian J Elec Eng & Comp Sci ISSN: 2502-4752 i) 749 


2. EVALUATION REVIEW IN TEXT STEGANOGRAPHY 

Essentially, the critical issue on the existence of text steganography method is easily discovered [6]. 
The detectors can discover the hidden messages exist in the text, if they notice the changes of the letter in the 
analysed text [7]. Also, the hidden messages which have been embedded will be lost, if the detector changes 
the text [8]. The detectors can do so if changes are done to the text such as deleting some letter or words, 
modifying its content, retyping, and other changes that directly make the hidden message vanish in the text. 

According to this issue, there are two problems discovered by previous researchers in letter-based 
technique, which are the lack of effectiveness in developing the text steganography method [9] and low 
security for covering text in order to hide hidden message [10]. Thus, this paper concerns how the influences 
of the effectiveness and security in term of development of text steganography implementation. 


2.1 Effectiveness of Steganography Performance 

The effectiveness performance is the indicators to aim achievement based on some measurement 
such as appropriateness, access system and quality performance [11]. In term of comparison, effectiveness is 
a claimed the appropriate result in on performance system has improved than other previous systems. In other 
words, if the quality performance is better than others are, it means the system used is effective and more 
deserved to be used in a specific application [12]. 

In text steganography, effectiveness is critically important factor to review in order to develop the 
technique [13]. It is because steganography can be the options for processing in securing messages. 
Therefore, it has to recognize further that steganography is effective to use in securing messages as part of 
information hiding. Specifically, for text steganography because covering message in text domain is more 
useful than other medium [9]. Those are viewing from most of important information in business and 
education usually is in domain of text [13]. 

Therefore, one of considering of this paper is about reviewed the performance of text steganography 
that measure that relate with effectiveness of development technique from the previous researchers effort. 
The effectiveness of text steganography performance is be viewed in side of accuracy in hide the hidden 
message, quality performance and appropriateness development technique. 


2.2 Security of Steganography Performance 

Steganography as a part of information security also influences in those aspects in security [14]. 
Securing the hidden message that embeds in the text to transmit toward the exact receiver is the significant 
point in text steganography. Therefore, rule of security in text steganography is so critical in process 
development of text steganography [15]. In security performance, confidentiality, integrity and availability of 
information has turn into the inevitability for the development and implementation for system process [16]. 
Then, the security in information hiding has major issue about existences of hidden message for sending 
information that can called availability of data [17]. 

This paper considers about security fundamental in text steganography in term of confidentiality, 
integrity and availability as the data securing information. The implication of three data information security 
has illustrated in Figure 3. 
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Figure 3. Fundamental category of security towards iimplementation of text steganography 
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Based on Figure 3 is plainly showed the considering implementation security in text steganography. 
This paper figure it out influencing security based on confidentiality, integrity or availability in maintain that 
in process securing this time. 

In short, this paper classifies the several techniques in word-rule based and feature-based as 
category text steganography. Then, it identify the issue of each technique in term of effectiveness in 
development the technique and the security in order to cover hidden message as the text steganography 
technique. 


3. WORD-RULE BASED TECHNIQUE 

The implementation of word-rule based technique is divided into two ways of hiding the message. 
The first technique, line-shift coding can be embedded vertically hidden message to conceal the message in 
the text. Meanwhile second technique, word-shift coding can embedded horizontally the hidden message to 
conceal the message in the text. The evaluation performances of word-rule based technique is based on the 
effectiveness and security performance. In security performance is clearly seem divided in to confidentiality, 


integrity and availability. The evaluation of word-rule base shown in Table 1. 


Table 1. Word-rule based technique implementation 





No. — Technique 


Justification 


Evaluation 





Line-Shift Coding 
1; Shifting second line 
[14] 


2: Line-shifted up and 
down [19] 

Word-shift coding 

3. Unique shaped [5] 


4. Word-shift coding 
[14] 


5 Alteration the length 
of image word [15] 


6 Neighbour difference 
[19] 


7. Distribution white 
space [20] 


8. Encode matrix for 
instant message (IM) 
[21] 

9. Integrated inter 
character [22] 

10. Advanced encryption 
standard (AES) 
algorithm [23] 


Shifted 1/300 inches the second line in each page up 
and down 


Embed of differential and between baseline and two 
centroids of adjacent lines. 


Develop the unique shaped in some degree of text 
shifted vertically 

stored the horizontal shifting code in several words 
that conserves the state and able to be apply in text 
file and pictures 

hidden the secret by shifting horizontally the words 
and text image can use this method based on 
alteration the length of images word 

The technique divided the different length of two 
consecutives to embed the hidden message when 0 
bit for unchanged space and 1 bit for changed 
neighbour difference 

justifies the document and not shifted the first and 
last word on each line of text, then distributed white 
space used attribute word spacing 

Encode of matrix based on online chat and inferior 
of encoded shipping adjacent letters for hidden 
message in plain English text. 

Adjusts the characters and words in document and 
distributed both of sides randomly. 

represents binary form the secret data to accomplish 
the process and small quantum increasing or 
decreasing the covers distance depend on the 
conforming amount bit in secret binary data 


In security (availability): this technique 
easy to attack that make lost existences of 
hidden message 

In security (integrity): Easy to detect hidden 
message 


In security (confidentiality): Low security 
technique in embed hidden message 

In security (confidentiality): Easy to detect 
hidden message 


In effectiveness: Consuming much time 
processing 


In effectiveness: complex process 
development in embedding and extracting 
in hidden message 


In effectiveness: limited portion to hide 
information. 


In effectiveness: redundancy in hiding 
message easily happened 


In effectiveness: limited portion to hide 
information. 

In effectiveness: low and complex 
algorithm performance 


Based on Table 1, word-rule based technique can develop with other additional techniques. Those 
techniques have own process of embedding and some drawback that relate with effectiveness and security 
performance in implementation of this technique: 

1. Interm of effectiveness performance, in word-rule based has taking a long time in process embedding in 
covering hidden message [5]. Then, the technique also has issue in process development process has 
complex embedding and extracting, redundancy happened and has limited portion for covering message 
[15, 19, 20, 21, 22, 23] that possibly influencing the quality performance of effectiveness technique. 

2. Interm of security the word-rule based technique has evaluation drawback in confidentiality that claim 
has low security in embedding that possible can be detect by third party [5, 14]. In availability some 
techniques is easy to attack that possibly make hidden message lost after embedding process [14]. 
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4. FEATURE-BASED TECHNIQUE 

Feature-based method alters the feature of letter by manipulating in the shape, size, and position of 
font in the text. Feature-based technique makes the reader difficult to recognize the hidden information in this 
text [5]. Then, due to the characteristic of this technique, feature-based has used by many researchers based 
on character of language in the world named language-based. In this paper, the language-based 
implementation has classified such as English, Arabic, Persian, Chinese and Hindi languages. Then, the 
technique of feature-based text steganography that used in any language using A-Z letter named letter-based. 
Besides, the classification of feature-based technique based on the evaluation performance of effectiveness 


and security (confidentiality, integrity and availability) that shown in Table 2. 


Table 2. Feature-based technique implementation 





No. 


Technique 


Justification 


Evaluation 





Language-based (English language) 
1; 


Machine translation 
[24] 


Translates the transmitted text but allow the source 
to keep on secret. The sender and receiver used key 
idea to hide information behind translation-based 
steganography in natural language translation. 


In effectiveness: High possibility error 
encode algorithm 


2: Mark up letter Markup letter resolve the secret bits to recover In security (integrity): Low security 
[25] distance of hidden message. embeds the file with technique in embed hidden message 

has no unwanted symbols in secret information 

3. Secret Substitute vowels and consonant based on In effectiveness: Dependable with vowel 
steganography code — grammatical sequence for cover text and then words. 
for embedding hidden message put in indefinite article for 
(SSCE) embedded in mapping information from non- 

[26] particular English. 

4. Right-to-Left Hide the information this method convert to binary In security (confidentiality): Low 
Remark and Left- bits while 00 bits for add nothing, 01 bits for ad flexibility that executed by appropriate 
to-Right remark Right-to Left Remark (U200F), 10 bits for Left-to- user 
[27] Right Remark (U200E), and 11 bits combination of 

U200F and U200E 

5: Deoxyribonucleic Represent binary bit 00, C for binary bits 10, G for In effectiveness: Limited scope 
Acid (DNA) binary bits 01, and T for binary bits 11 transform requirement process embedding 
steganography DNA sequence into chipper text using look-up table. 

[9] 

6. Right remark, Left The algorithm used stego key and input carrier for In effectiveness: Depending with others 
remark, Zero width created symbol table and binary representation for technique (ASCII and Unicode) 
joiner, and Zero hidden bits to embed in the text 
width joiner 
[28] 

rE Encryption with Reorders eight bit random key with 0 bits describes In effectiveness: complex process 


Cover Text and 
Reordering (ECR) 
[29] 


cover text and | bit describes encrypted text. 


Language-based (Arabic & Persian language) 
8. 


Letter point in novel 
Arabic 


uses binary bits when | bits are pointed letter with 
extension in Arabic character and for unpointed 


development in embedding and 
extracting in hidden message 


In effectiveness: Only the characters with 
extension can embed the stego text. 


[30] letter hold 0 bits with extension 
9. Vertical Encode the number bits while the 0 bit encode for In security (availability): Retyping 
displacement of the —_ letter without the point or dot and 1 bit for encode remove the hidden message 
point letter with the point. 
[31] 
10. Reversed Fatah Uses reversed Fatah to embed the hidden message In security (availability): Retyping 
[32] that convert into binary bits. remove the hidden message 
11. Secret key table Uses vowel letters to connected consonant of word In security (confidentiality): Low 


encryption (SKT) 
and Characteristic 
Code Mapping 
(CCM) 

[33] 


and designed part of punctuation to hide the 
information 


Language-based (Chinese language) 


security technique in embed hidden 
message 


12. Rectangular region Uses simple algorithm that based on content and In effectiveness: Medium letter for 
[34] method based on mathematical expression and embed is limited. 

automatic generation in Chinese letter. 

13. Simple substitution Changes the traditional form corresponds to 1 bits In security (confidentiality): Low 
method (SSM) and and the unchanged feature correspond to 0 bits. It security technique in embed hidden 
high efficient assumed the stego message based on bit what it is 0 message 
substitution method __ bits or 1 bits and extracted the feature if appear in 
(HESM) substitution dictionary. 

[35] 
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Table 2. Feature-based technique implementation (con’t) 





Language-based (Hindi language) 





14. 


15. 


16. 


17. 


18. 


19. 


Binary string 
[35] 


Specific matra 
[37] 
Numerical code 


[38] 


Finite state machine 
(FSM) 
[39] 


Chain code using 
ANOVA 

[40] 

Hindi Text using 
Matraye 

[41] 


Letter-based 


20 


21 


22 


23 


24 


Change alphabet 
letter pattern 
(CALP) 

[6] 

Curve subheading 
(CURVE), vertical 
straight line 
(VERT), and 
quadruple 
characterization 
[7] 

Back end interface 
web page 

[42] 


Compression ratio 
in Email 

[43] 

Huffman 
Compression in 
Email Based 

[44] 

Alphabet Pairing 
Text 

[2] 


Uses longest common subsequence so to hide the 
message with minimal modification the feature of 
letter. 

The hidden message and translated length of 
compressed to binary bits containing matra 

Every letter in this technique encode based on 
represent four binary bits and has to consider the 
first letter to decode the information by chipper 
phrase. 

Embed the binary string with the Finite State 
Machine (FSM) to describe alteration function and 
transformable symbols in every category. To embed 
binary 0 bits select next symbol of transformable 
from same category, otherwise select next symbol of 
transformable does not same and apply the 
transformation 

Translate code into the number signified one pixel to 
next pixel converts part of the chain and 
concatenated to form a chain. 

The vowel and matra add to consonant in Hindi 
letter, while matra itself ad in bottom and top of the 
consonant. The bottom matra hide 0 bits and the top 
matra hide | bits. 


Hidden message embedded based on binary bits 
categorization those are; the letter I and j are 
character in alphabet have pointed are embedded in 
0 bits and | bits embedded in character a, c and A 
Embed the message based on form of letter from the 
three techniques. Two techniques (CURVE and 
VERT) convert into 1 and 0 bits, while other 
technique (Quadruple) converts into 00, 01, 10, and 
11. 


Hiding the information based on HTML attributes 
using binary bit when | bits for two sequent 
attributes of same tags and 0 bits for two sequent 
attributes of different tags. 


Adds some random letter before the ‘@’ symbol of 
email ids to magnify randomness 


Set the hidden message in four fonts before ‘@’ 
symbol cover the hidden message after ‘@’ in email 
address. 


Arrange with across the message that consists of 4 
parts that have 2 binary bits 


In effectiveness: Limited medium to 
embedding hidden message 


In security (confidentiality): Low security 
technique in embed hidden message 

In security (availability): Retyping remove 
the hidden message. 


In effectiveness: Algorithm complex and 
consuming much time 


In security (confidentiality): Low security 
technique in embed hidden message 


In effectiveness: Complex classification of 
dividing letters. 


In effectiveness: Dependable with number 
of letter used convert 


In security (confidentiality): Easy to detect 
hidden message. 


In security (confidentiality): easy to detect 
with discover the HTML code 


In effectiveness: Only use for email 
environment 


In effectiveness: Only use for email 


environment 


In effectiveness: Algorithm Complex and 
Depending with ASCII. 





1. 


5. 


Based on Table 2, it is showed the list of feature-based technique that analysed the performance in 
term of effectiveness and security: 


CONCLUSIONS 


In term of effectiveness performance feature-based techniques, the review of evaluation has some 
drawback such as it has problem in algorithm development [25], [36], [38], dependable with some 
requirement [6, 28, 30], and some technique only able to use in some environment [43,44] Those are 
evaluation it is possibly influencing the quality performance of effectiveness technique. 

In security, the evaluation performance of feature-based technique has quite similar with word rule based 
like confidentiality that claim low security and easy to detect of hidden message exist in the text [7, 27, 
33, 35, 41]. Then, in security about viability component the existences critical has the problem while 
retyping process make the hidden message lost or do not exist the text [31, 32,38]. 


This paper is evaluating the category of the text steganography that consists of two groups that are 
word-rule based and feature-based techniques. It identifies both techniques where the word-rule based 
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consists of line-shift coding and of word-shift coding. Meanwhile, the feature-based technique comprises of 
two categories that are language-based and letter-based. This paper evaluates the performance of those 
techniques in terms of effectiveness and security. It is because the evaluation of effectiveness is useful to 
discover proper development process of technique. On the other hand, the security’s performance are consists 
of confidentiality, integrity, and availability which have been evaluated in order to analyse the existences oh 
hidden message. Thus, in future work, it is anticipated that a text steganography technique will be developed 
which has proper performance based on evaluation of the effectiveness and security. 
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